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NOVEL FLUORESCENT PROTEINS 
Field of invention 

The present invention relates to novel variants of the fluorescent protein GFP having im- 
proved fluorescence properties. 

5 Background 

The discovery that Green Fluorescent Protein (GFP) from the jellyfish A victoria retains 
its fluorescent properties when expressed in heterologous cells has provided biological 
research with a new, unique and powerful tool (Chalfie et al (1994). Science 263:802; 
Prasher (1995) Trends in Genetics 1 1:320; WO 95/07463). A very important aspect of 
10 using recombinant, fluorescent proteins in studying cellular functions is the non-invasive 
nature of the assay. This allows detection of cellular events in Intact, living cells. 

The excitation spectrum of the green fluorescent protein from Aequorea victoria shows 
two peaks: A major peak at 396nm, which is in the potentially cell damaging UV range, 
and a lesser peak at 475nm, which is in an excitation range that is much less harmful to 
15 cells. 

To improve the wild type GFP, a range of mutations have been described. Heim (GFP 
(Heim et al. (1994). Proc.Natl.Acad.Sci. 91:12501) described the discovery of a blue fluo- 
rescent variant which has greatly increased the potential applications of using fluorescent 
recombinant probes to monitor cellular events or functions, since the availability of probes 

20 having different excitation and emission spectra permits simultaneous monitoring of more 
than one process. However, the blue fluorescing variant described by Heim et al, Y66H- 
GFP, suffers from certain limitations: The blue fluorescence is weak (emission maximum 
at 448nm), thus making detection difficult, and necessitating prolonged excitation of cells 
expressing Y66H-GFP. Moreover, the prolonged period of excitation is damaging to cells 

25 especially because the excitation wavelength is in the UV range, 360nm - 390nm. 

Heim et al.(1995), Nature, Vol. 373, p. 663-4, discloses a Ser65Thr mutation of GFP 
(S65T) having longer wavelengths of excitation and emission, 490nm and 510nm, re- 
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spectively, than the wild-type GFP and wherein the fluorophore formation proceeded 
about fourfold more rapidly than in the wild-type GFP. 

Ehrig et al. (1995) FEBS Letters 367, 163-166, discloses a E222G mutant" of the Aequo- 
rea green fluorescent protein. This mutation has an excitation maximum of 481nm and an 
5 emission maximum at 506nm. 

Expression of GFP or its fluorescent variants in living cells provides a valuable tool for 
studying cellular events and it is well known that many cells, including mammalian cells, 
are incubated at approximately 37°C in order to secure optimal and/or physiologically 
relevant growth. Cell lines originating from different organisms or tissues may have differ- 
1 0 ent relevant temperatures ranging from about 35°C for fibroblasts to about 38°C - 39°C for 
mouse {3-cells. Experience has shown, however, that the fluorescent signal from cells ex- 
pressing GFP is weak or absent when said cells are incubated at temperatures above 
room temperature, cf. Webb, CD. et al., Journal of Bacteriology, Oct. 1 995, p. 5906-591 1 
Ogawa H. etal., Proc. Natl. Acad. Sci. USA, Vol. 92, pp. 11899-11903, December 1995 
15 and Lim et al. J. Biochem. 118, 13-17 (1995). The improved fluorescent variant S65T de- 
scribed by Heim et al. (1995) supra also displays very low fluorescence when incubated 
under normal culture conditions (37°C), cf. Kaether and Gerdes FEBS Letters 369 (1995) 
pp. 267-271. Many experiments involving the study of cell metabolism are dependent on 
the possibility of incubating the cells at physiologically relevant temperatures, i.e. tem- 
20 peratures at about 37°C. 

Thastrup et al. (1 997) EP 0 851 874 describes fluorescent proteins that exhibit high fluo- 
rescence in cells expressing them when said cells are incubated at a temperature of SOX or 
above. This is obtained with the amino acid in position 1 preceding the chromophore has 
been mutated. Examples of such mutations are F64L, F64I. F64V F64A and F64G. 

25 Various authors have experimented with combinations of mutations. One such combination 
is the F64L, S65T GFP (EGFP). EGFP exhibits high fluorescence when expressed at 30°C 
or above and has an excitation maximum at 488nm. 
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SUMMARY OF THE INVENTION 

The present invention provides novel fluorescent proteins, such as F64L-E222G-GFP that 
result in a cellular fluorescence far exceeding the cellular fluorescence when expressed at 
37°C and when excitated at 450 to 500nm compared to the parent proteins, i.e. GFP, the 
5 blue variant Y66H-GFP the S65T-GFP variant, and F64L-GFP. This greatly improves the 
usefulness of fluorescent proteins in studying cellular functions in living cells. 

It is shown that GFP mutated at the 64 position from F to L (F64L) and at the 222 position 
from E to G (E222G) has remarkable properties. It is first shown that the F64L.E222G- 
GFP has an entirely different spectrum than F64L.S65T-GFP (Example 2). In contrast, 
1 0 there is no substantial difference between folding characteristics (measured as the time 
when fluorescence is observed between the two GFPs, Example 3). Likewise, there was 
no difference between the pH sensitivity of the two GFPs (Example 4). The observed 
brightness of the E222G versus the S65T mutated F64L-GFPs is dependent on the test 
conditions (Example 5). 



15 DETAILED DESCRIPTION OF THE INVENTION 

One aspect of the present invention relates to a fluorescent protein derived from Green 
Fluorescent Protein (GFP) or any functional GFP analogue, wherein the amino acid in po- 
sition 1 preceding the chromophore has been mutated and wherein the Glutamic acid in 
position 222 has been mutated said mutated GFP has an excitation maximum at a higher 
20 wavelength compared to F64L-GFP and the fluorescence is increased when the mutated 
GFP is expressed in cells incubated at a temperature of 30°C or above compared to wild- 
type GFP. 



The excitation and emission characteristics of the F64L.E222G-GFP differ significantly 
from wild-type GFP and EGFP. Existing fluorescent proteins have demonstrated utility for 
research applications such as quantitative fluorescence microscopy (Patterson, G.H., et al 
(1997). BiophysicalJ. 73:2782-2790; Piston, D.W.,ef a/ (1999) Meth. Cell Biol. 58:31-48). 
It is now clear, however, that the optimal fluorescent protein characteristics for high- 
throughput screening (HTS) applications in drug discovery differ somewhat from those for 
research applications (Kain, S. R. (1999) Drug Discovery Today 4:304-312). For example, 
factors such as optimal and signal/noise are more important for HTS applications in drug 
discovery than are absolute brightness of probes such as fluorescent proteins. The 



WO 01/98338 



4 



PCT/EP01/06848 



10 



F64L.E222G-GFP described in this patent application has an excitation maximum of 470 
nm and an emission maximum of 505 nm (see Figure 3:), compared to the respective ex- 
citation and emission maxima of 490 nm and 510 nm for EGFP. This results in a Stokes 
shift of 35 nm for F64L,E222G-GFP, as compared to 20 nm for EGFP. This results in a 
5 significant increase in the excitation-emission band separation for F64L.E222G-GFP rela- 
tive to EGFP with several implications for the use of F64L.E222G-GFP in high-throughput 
screening. Some of these are listed below: 

1. The increased Stokes shift of F64L.E222G-GFP results in increased spectral resolution 
of its excitation and emission peaks. This enables more complete band separation us- 
ing a conventional dichroic beam-splitter, and decreased background signal for assays 
incorporating F64L.E222G-GFP relative to assays based on EGFP. 

2. F64L.E222G-GFP fluorescence can be excited by conventional light sources using nar- 
row band filters, or commercially available laser producing lines at 472 nm. In either 
case, the greater Stokes shift of F64l_E222G-GFP results in lower cross-talk from ex- 

1 5 citation light to the toe of the emission spectrum. 

3. The excitation maximum of F64L.E222G-GFP falls midway between those of the cyan 
fluorescent protein variant (ECFP, excitation max -433 nm) and the yellow fluorescent 
protein variant (EYFP, excitation max -513 nm). Because of this, it will allow for 
cleaner band separation when used together with those probes, and it is optimized for 
assay applications in which several GFP-labeled components will be multiplexed. 



20 



Many sources of GFPs exist. Examples are GFP derived from Aequoma victoria and GFP 
derived from Renilla. Various GFPs have been isolated from Ren///a examples are reni- 
formis and mufleri. As described in the examples and in SEQ ID NOs: 3 and 4, the chrc- 
25 mophore in Aequorea victoria is in position 65^7 of the predicted primary amino acid se^ 
quence of GFP. Thus, in a preferred embodiment the GFP is derived from Aequorea wc- 
toria. 

It is preferred that the mutation at F64 is a mutation to an aliphatic amino acid. Examples are 
F64L, F64I, F64V, F64A, and F64G, wherein the F64L substitution being most preferred. 
30 However other mutations, e.g. deletions, insertions, or post-translational modifications im- 
mediately preceding the chromophore are also included in the invention, provided that they 
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result in improved fluorescence properties of the various fluorescent proteins. It should be 
noted that extensive deletions may result in loss of the fluorescent properties of GFP. 

The E222G, E222A, E222V, E222L, E2221, E222F, E222S, E222T, E222N, E222Q sub- 
stitutions are preferred, the E222G substitution (that is substitution to Glycine) being most 
5 preferred. 

A preferred sequence of the gene encoding GFP derived from Aequorea victoria is dis- 
closed in SEQ ID NO: 3 (enhanced) and in SEQ ID NO: 7 G'elly fish). SEQ ID NO: 1 shows 
the nucleotide sequence of F64L-GFP with humanised codon. SEQ ID NO: 5 shows the 
nucleotide sequence of F64L-GFP with jellyfish codon. Besides, the novel fluorescent 
10 proteins may also be derived from other fluorescent proteins as mentioned above. 

Herein the abbreviations used for the amino adds are those stated in J. Biol. Chem. 243 
(1968), 3558. 

One aspect of the invention relates to a nucleotide sequence coding for the Fluorescent 
protein F64L-E222G-GFP. An example of such F64L-E222G-GFP is shown in list 2. In a 
1 5 preferred aspect the nucleotide sequence is in the form of a DNA sequence. 

The DNA construct of the invention encoding the novel fluorescent proteins may be pre- 
pared synthetically by established standard methods, e.g. the phosphoamidite method 
described by Beaucage and Caruthers. Tetrahedron Letters 22 {19811. 1859 - 1869, or 
the method described by Matthes et aL, EMBO Journal 3 (1984), 801 - 805. According to 
20 the phosphoamidite method, oligonucleotides are synthesized, e.g. in an automatic DNA 
synthesizer, purified, annealed, ligated and cloned in suitable vectors. 

The DNA construct may also be prepared by polymerase chain reaction (PCR) using spe- 
cific primers, for instance as described in US 4,683,202 or Saiki et al., Science 239 
(1988), 487 - 491 . A more recent review of PCR methods may be found in PCR Protocols . 
25 1990, Academic Press, San Diego, California, USA. 

The DNA construct of the invention may be inserted into a recombinant vector which may 
be any vector which may conveniently be subjected to recombinant DNA procedures. The 
choice of vector will often depend on the host cell into which it is to be introduced. Thus, 
the vector may be an autonomously replicating vector, i.e. a vector which exists as an ex- 
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trachromosomal entity, the replication of which is independent of chromosomal replication, 
e.g. a plasmid. Alternatively, the vector may be one which, when introduced into a host 
cell, is integrated into the host cell genome and replicated together with the chromo- 
some^) into which it has been integrated. 

5 The vector is preferably an expression vector in which the DNA sequence encoding the 
fluorescent protein of the invention is operably linked to additional segments required for 
transcription of the DNA. In general, the expression vector is derived from plasmid or viral 
DNA, or may contain elements of both. The term, "operably linked" indicates that the 
segments are arranged so that they function in concert for their intended purposes, e.g. 
10 transcription initiates in a promoter and proceeds through the DNA sequence coding for 
the fluorescent protein of the invention. 

The promoter may be any DNA sequence which shows transcriptional activity in the host 
cell of choice and may be derived from genes encoding proteins either homologous or 
heterologous to the host cell, including native Aequorea GFP genes. 

1 5 Examples of suitable promoters for directing the transcription of the DNA sequence en- 
coding the fluorescent protein of the invention in mammalian cells are the SV40 promoter 
(Subramani et al.. Mol. Cell Biol. ± (1981), 854 -864), the MT-1 (metallothionein gene) 
promoter (Palmiter et al., Science 222 (1983). 809 - 814) or the adenovirus 2 major late 
promoter. 

20 An example of a suitable promoter for use in insect cells is the polyhedrin promoter (US 
4,745,051; Vasuvedan et al., FEBS Lett. 311, (1992) 7 - 1 1), the P10 promoter (J.M. Vlak 
et al.. J. Gen. Viroloqy 69, 1988, pp. 765-776). the Autographa califomica polyhidrosis 
virus basic protein promoter (EP 397 485), the baculovirus immediate early gene 1 pro- 
moter (US 5,155,037; US 5.162,222). or the baculovirus 39K delayed-early gene promoter 

25 (US 5,155,037; US 5,162,222). 

Examples of suitable promoters for use in yeast host cells include promoters from yeast 
glycolytic genes (Hitzeman et al., J. Biol. Chem. 255 (1980). 12073 - 12080; Alber and 
Kawasaki. J. Mol. Appl. Gen, 1 (1982). 419 - 434) or alcohol dehydrogenase genes 
(Young et al., in Genetic Engine ering of Microorganisms fnr Hh^mir^ic (Hollaender et al, 
30 eds.), Plenum Press, New York, 1 982), or the JPLL (US 4.599.31 1 ) or ADH2-4c (Russell 
et al., Nature 304 (1983), 652 - 654) promoters. 
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Examples of suitable promoters for use in filamentous fungus host cells are, for instance, 
the ADH3 promoter (McKnight et al. f The EMBO J. 4 (1985), 2093 - 2099) or the teiA 
promoter. Examples of other useful promoters are those derived from the gene encoding 
A. oryzae TAKA amylase, Rhizomucor miehei aspartic proteinase, A. niger neutral ct- 
5 amylase, A. niger acid stable a-amylase, A. niger or A. awamori glucoamylase (gluA), 
Rhizomucor miehei lipase, A. oryzae alkaline protease, A. oryzae triose phosphate iso- 
merase or A, nidulans acetamidase. Preferred are the TAKA-amylase and gluA promot- 
ers. 

Examples of suitable promoters for use in bacterial host cells include the promoter of the 
1 0 Bacillus stearothermophilus maltogenic amylase gene, the Bacillus licheniformis alpha- 
amylase gene, the Bacillus amyloliquefaciens BAN amylase gene, the Bacillus subtilis al- 
kaline protease gene, or the Bacillus pumilus xylosidase gene, or by the phage Lambda 
P R or P L promoters or the E. coli lac, tre or tec promoters. 

The DNA sequence encoding the novel fluorescent proteins of the invention may also, if 
15 necessary, be operably connected to a suitable terminator, such as the human growth 
hormone terminator (Palmiter et al., o^ cjt) or (for fungal hosts) the TPI1 (Alber and Ka- 
wasaki, o^ cjt) or ADH3 (McKnight et aL, o^ dt) terminators. The vector may further 
comprise elements such as polyadenylation signals (e.g. from SV40 or the adenovirus 5 
Elb region), transcriptional enhancer sequences (e.g. the SV40 enhancer) and transla- 
20 tional enhancer sequences (e.g. the ones encoding adenovirus VA RNAs). 

The recombinant vector may further comprise a DNA sequence enabling the vector to 
replicate in the host cell in question. An example of such a sequence (when the host cell 
is a mammalian cell) is the SV40 origin of replication. 

When the host cell is a yeast cell, suitable sequences enabling the vector to replicate are 
25 the yeast plasmid 2\i replication genes REP 1-3 and origin of replication. 

The vector may also comprise a selectable marker, e.g. a gene the product of which com- 
plements a defect in the host cell, such as the gene coding for dihydrofolate reductase 
(DHFR) or the Schizosaccharomyces pombe TPI gene (described by P.R. Russell, Gene 
40, 1985, pp. 125-130), or one which confers resistance to a drug, e.g. ampicillin, kana- 
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mycin, tetracyclin, chloramphenicol, neomycin or hygromycin. For filamentous fungi, se- 
lectable markers include amdS . pyrG , arqB . niaD . sC. 

The procedures used to ligate the DNA sequences coding for the fluorescent protein of 
the invention, the promoter and optionally the terminator and/or secretory signal se- 
5 quence, respectively, and to insert them into suitable vectors containing the information 
necessary for replication, are well known to persons skilled in the art (cf., for instance, 
Sambrook et al., op.ciU . 

The host cell into which the DNA construct or the recombinant vector of the invention is 
introduced may be any cell which is capable of expressing the present DNA construct and 
10 includes bacteria, yeast, fungi and higher eukaryotic cells. 

Examples of bacterial host cells which, on cultivation, are capable of expressing the DNA 
construct of the invention are grampositive bacteria, e.g. strains of Bacillus, such as B. 
subtilis, B. licheniformis, B. lentus, B. brevis, B. stearothermophilus, B. alkalophilus, B. 
amyloliquefaciens, B. coagulans, B. circu/ans, B. lautus, B. megatherium or B. thuringien- 
1 5 s/s, or strains of Streptomyces, such as S. IMdans or S. murinus, or gramnegative bacte- 
ria such as Echerichia coli. The transformation of the bacteria may be effected by proto- 
plast transformation or by using competent cells in a manner known per se (cf. Sambrook 
et al., supra ). 

Examples of suitable mammalian cell lines are the HEK293 and the HeLa cell lines, pri- 
20 mary cells, and the COS (e.g. ATCC CRL 1650), BHK (e.g. ATCC CRL 1632, ATCC CCL 
10), CHL (e.g. ATCC CCL39) or CHO (e.g. ATCC CCL 61) cell lines. Methods of trans- 
fecting mammalian cells and expressing DNA sequences introduced in the cells are de- 
scribed in e.g. Kaufman and Sharp, J. Mot. Biol. 159 (1982), 601 - 621; Southern and 
Berg, J. Mol. Appl. Genet. 1 (1982), 327 - 341; Loyter et al., Proc. Natl. Acad. Sci. USA 79 
25 (1982), 422 - 426; Wigler et al., CeJ1 14 (1978). 725; Corsaro and Pearson, Somatic Cell 
Genetics I ( 1981 ). 603, Graham and van der Eb, Virology 52 (1973), 456; and Neumann 
et al., EMBO J. 1 (1982), 841 - 845. 

Examples of suitable yeast cells include cells of Saccharomyces spp. or Schizosaccha- 
romyces spp., in particular strains of Saccharomyces cerevisiae or Saccharomyces 
30 kluyveri. Methods for transforming yeast cells with heterologous DNA and producing het- 
erologous polypeptides therefrom are described, e.g. in US 4,599,311, US 4,931,373, US 
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4,870,008, 5,037,743, and US 4,845,075, all of which are hereby incorporated by refer- 
ence. Transformed cells are selected by a phenotype determined by a selectable marker, 
commonly drug resistance or the ability to grow in the absence of a particular nutrient, e.g. 
leucine. A preferred vector for use in yeast is the POT1 vector disclosed in US 4,931,373. 
5 The DNA sequence encoding the fluorescent protein of the invention may be preceded by 
a signal sequence and optionally a leader sequence , e.g. as described above. Further 
examples of suitable yeast cells are strains of Kluyveromyces, such as Hi lactis t Hanse- 
nula, e.g. H. polymorpha, or Pichia, e.g. P. pastoris (cf. Gleeson et al., J. Gen. Microbiol. 
132, 1986, pp. 3459-3465; US 4,882,279). 

1 0 Examples of other fungal cells are cells of filamentous fungi T e.g. Aspergillus spp., Neu- 
rospora spp., Fusarium spp. or Trichoderma spp., in particular strains of A. oryzae, A 
nidulans or A. niger. The use of Aspergillus spp. for the expression of proteins is de- 
scribed in, e.g., EP 272 277, EP 230 023, EP 184 438. 

When a filamentous fungus is used as the host cell, it may be transformed with the DNA 
1 5 construct of the invention, conveniently by integrating the DNA construct in the host chro- 
mosome to obtain a recombinant host cell. This integration is generally considered to be 
an advantage as the DNA sequence is more likely to be stably maintained in the cell In- 
tegration of the DNA constructs into the host chromosome may be performed according to 
conventional methods, e.g. by homologous or heterologous recombination. 

20 Transformation of insect cells and production of heterologous polypeptides therein may be 
performed as described in US 4,745,051; US 4,879,236; US 5,155,037; 5,162,222; EP 
397,485) all of which are incorporated herein by reference. The insect cell line used as the 
host may suitably be a Lepidoptera cell line, such as Spodoptera frugiperda cells or 
Trichoplusia ni cells (cf. US 5,077,214). Culture conditions may suitably be as described 

25 in, for instance, WO 89/01029 or WO 89/01028, or any of the aforementioned references. 

One aspect of the invention relates to a host transformed with a DNA construct according 
to any of the preceding aspects. The transformed or transfected host cell described above 
is then cultured in a suitable nutrient medium under conditions permitting the expression 
of the present DNA construct after which the cells may be used in the screening method 
30 of the invention. Alternatively, the cells may be disrupted after which cell extracts and/or 
supernatants may be analysed for fluorescence. 
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The medium used to culture the cells may be any conventional medium suitable for 
growing the host cells, such as minimal or complex media containing appropriate supple- 
ments. Suitable media are available from commercial suppliers or may be prepared ac- 
cording to published recipes (e.g. in catalogues of the American Type Culture Collection). 

5 in the method of the invention, the fluorescence of cells transformed or transfected with 
the DNA construct of the invention may suitably be measured in a spectrometer or a fluo- 
rescence microscope where the spectral properties of the cells in liquid culture may be 
determined as scans of light excitation and emission. 

One aspect of the invention relates to a fusion compound consisting of a fluorescent pro- 
10 tein (F64L-E222G-GFP ), wherein the (F64L-E222G-GFP ) is linked to a polypeptide. Ex- 
amples of such polypeptide is kinase, preferably the catalytic subunit of protein kinase A, 
or protein kinase C, or Erk1 , or a cytoskeletal element. 

The invention further relates to a process for preparing a polypeptide, comprising culti- 
vating a host according to any of the preceding aspects and obtaining therefrom the poly- 
15 peptide expressed by said nucleotide sequence. 

The various aspects of the invention have a plethora of uses. Some of these are de- 
scribed below: 

Use of F64L-E222G-GFP in an in vitro assay for measuring protein kinase activity, or 
dephosphorylation activity, or for measuring protein redistribution. 

20 Use of F64L-E222G-GFP as a protein tag in living and fixed cells. Due to the strong fluo- 
rescence the novel proteins are suitable tags for proteins present at low concentrations. 
Since no substrate is needed and visualisation of the cells does not damage the cells dy- 
namic analysis can be performed. 

Use as an organelle tag. More than one organelle can be tagged and visualised simulta- 
25 neously in living cells, e.g. the endoplasmic reticulum and the cytoskeleton. 

Use as a secretion marker. By fusion of F64L-E222G-GFP to a signal peptide or a peptide 
to be secreted, secretion may be followed on-line in living cells. A precondition for that is 
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that the maturation of a detectable number of novel fluorescent protein molecules occurs 
faster than the secretion. 

Use as genetic reporter or protein tag in transgenic animals. Due to the strong fluores- 
cence of the novel proteins, they are suitable as tags for proteins and gene expression, 
5 since the signal to noise ratio is significantly improved over the prior art proteins, such as 
wild-type GFP. 

Use as a cell or organelle integrity marker. By co-expressing two of the novel proteins, the 
one targeted to an organelle and the other expressed in the cytosol, it is possible to cal- 
culate the relative leakage of the cytosolic protein and use that as a measure of cell inte- 
10 grety. 

Use as a marker for changes in cell morphology. Expression of the novel proteins in cells 
allows easy detection of changes in cell morphology, e.g. blebbing, caused by cytotoxic 
agents or apoptosis. Such morphological changes are difficult to visualize in intact cells 
without the use of fluorescent probes. 

15 Use as a transfection marker, and as a marker to be used in combination with FACS 
sorting. Due to the increased brightness of the novel proteins the quality of cell detection 
and sorting can be significantly improved. 

Use as real-time probe working at near physiological concentrations Since F64L-E222G- 
GFP is significantly brighter than wild type GFP and F64L-GFP when expressed in cells at 

20 about 37°C and excited with light at about 490 nm, the concentration needed for visuali- 
zation can be lowered. Target sites for enzymes engineered into the novel proteins, e.g. 
F64L-E222G-GFP, can therefore be present in the cell at low concentrations in living 
cells. This is important for two reasons: 1 ) The probe must interfere as little as possible 
with the intracellular process being studied; 2) the translational and transcriptional appa- 

25 ratus should be stressed minimally. 

The novel proteins can be used as reporters to monitor live/dead biomass of organisms, 
such as fungi. By constitutive expression of F64L-E222G-GFP in fungi the viable biomass 
will light up. 
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Transposon vector mutagenesis can be performed using the novel proteins as markers in 
transcriptional and translational fusions. 

Transposons to be used in microorganisms encoding the novel proteins. The transposons 
may be constructed for translational and transcriptional fusions. To be used for screening 
5 for promoters. 

Transposon vectors encoding the novel proteins, such as F64L-E222G-GFP, can be used 
for tagging plasmids and chromosomes. 

Use as a reporter for bacterial detection by introducing the novel proteins into the genome of 
bacteriophages. 

10 By engineering the novel proteins, e.g. F64L-E222G-GFP, into the genome of a phage a 
diagnostic tool can be designed. F64L-E222G-GFP will be expressed only upon transfec- 
tion of the genome into a living host. The host specificity is defined by the bacteriophage. 

The invention is further illustrated in the following examples with reference to the appended 
sequence lists. 



1 5 Table 1 List of sequences 

Name Nucleotide SEQ ID NO: Protein SEQ ID NO: 



2 



e-F64L-GFP 1 
(PS399) 

e-F64L-E222G-GFP 3 4 
(PS699) 

jf-F64L-GFP 5 B 

(PS350) 

jf-F64L-E222G-GFP 7 8 
(PS1186) 
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Legend to Figures 

PS codes are explained in Table 2. 
Figure 1: 

Excitation spectra of PS1 189 (excitation maximum at 492 nm), PS1 1 91 (excitation maxi- 
5 mum at 468 nm), PS1 1 85 (excitation maximum at 490 nm) and PS1 1 86 (excitation maxi- 
mum at 473 nm). The emissions were recorded at 560 nm. The samples of PS1 189 and 
PS1 191 were 2-fold diluted and the samples of PS1 185 and PS1 186 were 10-fold diluted. 

Figure 2: 

Emission spectra of PS1189 (emission maximum at 509 nm), PS1191 (emission maxi- 
1 0 mum at 505 nm), PS1 1 85 (emission maximum at 51 0 nm) and PS1 1 86 (emission maxi- 
mum at 506 nm). Excitation was at 430 nm. The samples of PS1 189, PS1 191 and 
PS1 185 were 2-fold diluted and the sample of PS1 186 was 10-fold diluted. The curves for 
PS 1 189 and PS1191 relate to the primary y-axis whereas the curves for PS1 185 and 
PS1 186 relate to the secondary y-axis. 

1 5 Figure 3: 

Overlapping excitation (Ex) and emission (Em) spectra of PS1189 (panel A), PS1191 
(panel B), PS1 185 (panel C), and PS1 186 (panel D). The excitation curve to the left and 
the excitation curve to the right relate to the primary and secondary y-axis, respectively. 

Figure 4 

20 This figure shows the images collected after Lipofectamine 2000 transfection. 
eF64L,E222G (PS699) is at the top of the right column referred to as E222G, 
eF64L,S65T-GFP (PS279) is at the top of the left column referred to as EGFP. 

Figure 5 

Comparing the pH sensitivity of EGFP (PS279) and eF64L,E222G-GFP (PS699). 
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EXAMPLES 

Example 1: Construction of GFP plasmids 

Plasmids pEGFP-N1 (GenBank accession number U55762) and pEGFP-C1 (GenBank 
accession number U55763) both contain a derivative of GFP in which one extra amino 
5 acid has been added at position two to provide a better translational start sequence (a 
Kozak sequence) and so the total number of amino acids is increased by one to 239 in- 
stead of the 238 found in wildtype GFP. Therefore the denomination of mutations in GFP 
in these plasmids strictly should be referred to as e.g. F65L rather than F64L. However, to 
avoid this source of confusion and because the GFP community has adopted the num- 
10 bering system of wildtype GFP in its communications, the numbers used here conform to 
the commonly used naming of mutations in wildtype GFP. The relevant mutations in this 
respect are F64L, S65T, and E222G. 

Plasmids pEGFP-N1 and pEGFP-C1 contain the following mutations in the chromophore: 
F64L and S65T . The codon usage of the GFP DNA sequence has been optimized for ex- 
5 pression in mammalian cells. N1 and C1 refer to the position of multiple cloning sites rela- 
tive to the GFP sequence. 

To construct a plasmid combining F64L and E222G, pEGFP-N1 and pEGFP-C1 were first 
subjected to PCR with primers 9859 and 9860 described below. The primers are comple- 
mentary to the DNA sequence around the chromophore region and introduce a point mu- 
0 tation changing the threonine at position 65 to serine. In addition the primers introduce a 
unique Spe1 restriction site by silent mutation. The 4.7 kb PCR products were digested 
with Spe1, religated, and transformed into E.coli. The resulting plasmids are referred to as 
PS399 (N1 context) and PS401 (C1 context). These plasmids contain the chromophore 
sequence 64-LSYG-67. Plasmids PS399 and PS401 were subjected to Quick-Change 
5 mutagenesis (Stratagene) employing PCR with primers 0225 and 0226 described below. 
These primers are complementary to sequences near the C-terminus of the GFP and 
change glutamate at position 222 to glycine, and in addition they introduce an Avr2 re- 
striction site by silent mutation. The resulting plasmids are referred to as PS699 (N1 con- 
text) and PS701 (C1 context). They combine an LSYG chromophore with E222G with 
humanised codon and is referred to as eF64L,E222G (see sequence list 2) 



WO 01/98338 



PCT/EP01/06848 



15 

9859- top: S'-TGTACTAGTGACCACCCTGTCTTACGGCGTGCA-S 1 

9860- bottom: 5-CTGACTAGTGTGGGCCAGGGCACGGGCAGC-3' 

0225- bottom: 5-CCCGGCGGCGGTCACGAACCCTAGGAGGACCATGTGATCGCG-3' 

0226- top: 5'-CGCGATCACATGGTCCTCCTAGGGTTCGTGACCGCCGCCGGG-3 f 

5 A plasmid encoding a GFP directly derived from jellyfish with F64L (disclosed in figure 4 
of W097/1 1094,) was subjected to PCR with primers 9840 & 9841 described below. The 
PCR product was digested with restriction enzymes Age1 and Acc65 and ligated into 
pEGFP-N1 digested with Age1 and BsrG1. This replaces EGFP with F64L-GFP and in- 
troduces an amino acid change L236G near the c-terminus as a consequence of joining 
10 Acc65 and BsrG1 sites. This plasmid is referred to as PS350. 

A plasmid encoding a GFP directly derived from jellyfish with F64L, S65T (disclosed in 
figure 5 of W097/1 1094,) was subjected to PCR with primers 9840 & 9841 described be- 
low. The PCR product was digested with restriction enzymes Age1 and Acc65 and ligated 
into pEGFP-N1 digested with Age1 and BsrG1. This replaces EGFP with F64L, S65T- 
15 GFP and introduces an amino acid change L236G near the c-terminus as a consequence 
of joining Acc65 and BsrG1 sites. This plasmid is referred to as PS351. 

Plasmid PS350 was subjected to QuickChange PCR (Stratagene) with primers 0317 & 
0318 described below. This introduces E222G by mutation and an Avr2 restriction site by 
silent mutation. This plasmid is referred to as PS832. 

20 Plasmid PS832 was subjected to QuickChange PCR (Stratagene) with primers 0325 & 
0326 described below. This introduces L64F by mutation and a Psp1406 restriction site by 
silent mutation. This plasmid is referred to as PS845. 

A plasmid encoding a GFP directly derived from jellyfish (disclosed in figure 2a of 
WO97/11094) was subjected to PCR with primers 9840 & 9841 described below. The 
25 PCR product was digested with restriction enzymes Age1 and Acc65 and ligated into 
pEGFP-N1 digested with Age1 and BsrG1. This replaces EGFP with wildtype GFP and 
introduces an amino acid change L236G near the c-terminus as a consequence of joining 
Acc65 and BsrG1 sites. This plasmid is referred to as PS854. 
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Plasmid PS399 was subjected to QuickChange PCR (Stratagene) with primers 0327 & 
0328 described below. This introduces L64F by mutation and a Psp1406 restriction site by 
silent mutation. This plasmid is referred to as PS844. 

Plasmid PS699 was subjected to QuickChange PCR (Stratagene) with primers 0327 & 
5 0328 described below. This introduces L64F by mutation and a Psp1406 restriction site by 
silent mutation. This plasmid is referred to as PS846. 

9840- top: 5-GTACCGGTCACCATGAGTAAAGGAGAAGAAC-3' 

9841- bottom: 5'-TTATTGGTACCCTTCATCCATGCCATGTG-3' 

031 7-top: 5'-GAGATCACATGATCCTCCTAGGGTTTGTAACAGCTGCTGGG-3' 
10 0318-bottom: 5'-CCCAGCAGCTGTTACAAACCCTAGGAGGATCATGTGATCTC-3' 

0325- top: 5'-CCAACGCTTGTCACAACGTTTTCTTATGGTGTTC-3' 

0326- bottom: 5*-GAACACCATAAGAAAACGTTGTGACAAGCGTTGG-3* 

0327- top: 5'-CCCACACTAGTGACAACGTTTTCTTACGGCGTGC-3' 

0328- bottom: 5'-GCACGCCGTAAGAAAACGTTGTCACTAGTGTGGG-3 , 

1 5 Plasmids encoding GFPs in jellyfish codon context (PS350, PS351, PS832, PS845, 

PS854) were subjected to PCR with primers 1259 and 1260 described below. The ca 0.8 
kb PCR products were cut with restriction enzymes BspH1 and BamH1 , and ligated into 
E.coli expression vector pTrcHis (from Invitrogen) cut with Nco1 and BamH1. This places 
the GFPs under control of the ITPG-inducible promoter in the vector. The bottom primer 

20 1260 also changes the glycine at position 236 back to leucine. The resulting plasmids are 
referred to as PS1 184 (|f-F64L-GFP), PS1 185 (jf-F64L,S65T-GFP), PS1186 (jf- 
F64L.E222G-GFP), PS1187 (jf-E222G-GFP) and PS (jf-GFP). 

Plasmids encoding GFPs in humanised enhanced codon context (PS279 = pEGFP-N1 
(Clontech), PS399, PS699, PS844, PS846) were subjected to PCR with primers 1261 and 

25 1262 described below. The ca 0.8 kb PCR products were cut with restriction enzymes 
Nco1 and BamH1, and ligated into E.coli expression vector pTrcHis (from Invitrogen) cut 
with Nco1 and BamH1. This places the GFPs under control of the ITPG-inducible pro- 
moter in the vector. The resulting plasmids are referred to as PS 1 189 (e-F64L,S65T-GFP 
= EGFP), PS1190 (e-F64L-GFP), PS1191 (e-F64L,E222G-GFP), PS1192 (e-GFP) and 

30 PS1193(e-E222G-GFP). 
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1 259- top: 5'-GTTGTTTCATGAGTAAAGGAGAAGAACTTTTC-3 , 

1 260- bottom: 5-GTTGGATCCTTATTTGTATAGTTCATCCATG-3' 
1 261 -top. 5 -GTTGTTCCATGGTGAGCAAGGGCGAGGAGCTG-3' 
1 262-bottom: 5'-GTTGGATCCTTACTTGTACAGCTCGTCCATG-3' 

5 The plasmids described above were transformed into E.coli strain DH5a!pha (Life Tech- 
nologies). Single colonies were picked and grown overnight at 37C in LB medium con- 
taining 1mM IPTG. 0.5 ml cells were pelleted and stored at-20C until they were ana- 
lyzed. 



Table 2 Summary table of plasmids encoding GFPs with indicated amino acids at posi- 
10 tions 64 t 65 and 222. 



mammalian 
cell expres- 
sion piasmid 




Backbone- 
codon us- 
age 


aa 

pos 

64 


aa 

pos 

65 


aa 

pos 

222 


E. coli expression 
piasmid 


PS846 


e-F???G-GFP 


enhanced 


F 


S 


G 


PS1193 


PS844 


e-GFP 


enhanced 


F 


S 


E 


PS1192 


PS699 


e-F64L,E222G-GFP 


enhanced 


L 


S 


G 


PS1191 


PS399 


e-F64L-GFP 


enhanced 


L 


S 


E 


PS1190 


PS279 


EGFP 


enhanced 


L 


T 


E 


PS1189 


PS854 


jf-GFP 


jellyfish 


F 


S 


E 


PS1188 


PS845 


jf-E222G-GFP 


jellyfish 


F 


S 


G 


PS1187 


PS832 


jf-F64L,E222G-GFP 


jellyfish 


L 


s 


G 


PS1186 


PS351 


jf-F64L,S65T-GFP 


jellyfish 


L 


T 


E 


PS1185 


PS350 


jf-F64L-GFP 


jellyfish 


L 


S 


E 


PS1184 



Example 2: Determination of spectral properties of proteins EGFP and 
eF64L,E222G. 

Plasmids expressing EGFP from piasmid pEGFP-N1 (also referred to as PS279), and 
15 eF64L,E222G from piasmid PS699 were transfected into E.Coli TOP10 cells (Invftrogen) 
using lipofectamine 2000 (from Life Technologies) according to manufacturers recom- 
mendations. After 5 days cells were collected and resuspended in extraction buffer 50mM 
TRIS(pH8.0) with 1mM DTT. Cells were lysed by 3 cycles of freeze-thaw. Cell debris was 
centrifuged out at 10000g in acooled centrifuge. NaCI was added to 100rnM. 
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The cell pellets were resuspended in 1000 nl of H 2 0 each (2-fold dilution relative to vol- 
umes of pelleted cultures) and transferred to 1.0x0.5 cm plastic cuvettes and the following 
excitation and emission spectra were recorded on a Perkin Elmer LS50B luminescence 
spectrometer 

5 Excitation spectrum: 

Excitation at 350-525 nm (5 nm slit width) Emission 560 nm (10 nm slit width) 
Data presented in Figure 1. 

Emission spectrum: 

Excitation at 430 nm (1 0 nm slit width) Emission 450-550 nm (5 nm slit width) 
10 Data presented in Figure 2. 

Using the same settings, excitation and emission spectra of 10-fold (200 ^l of 2-fold di- 
luted cells mixed with 800 ^1 of water) diluted cells were recorded for the strongly fluores- 
cent samples expressed from cDNAs with jellyfish backbone (PS1 185 and PS1 186). 

In contrast to the expression levels, the fluorescence properties of the probes were inde- 
15 pendent of the codon usage. The spectra recorded for the probes with Thr65:E222 

(PS 11 85 and PS1 189) were very similar (excitation and emission maxima at 490-492 nm 
and 509-510 nm, respectively) and with Stokes shifts of 17-20 nm. Likewise, the spectra 
recorded for the probes with Ser65:G222 (PS1 186 and PS1 191) were very similar (exci- 
tation and emission maxima at 468-473 nm and 505-506 nm, respectively) and with 
20 Stokes shifts of 33-37 nm. 



Example 3: Determination of time to fluorescence of EGFP and eF64L,E222G 
in CHO cells. 

Three, 2 well chambers with CHOhIR cells were transfected with plasmid PS279 ex- 
pressing EGFP and plasmid PS699 expression eF64L,E222G using the Lipofectamine 
25 transfection method. 

Fluorescence from the cells was checked at regular intervals after transfection. 

Lipofectamine 2000 transfection method was used to transfect EGFP and eF64L,E222G 
in one, 8-well chamber with CHOhIR cells. Fluorescence from the cells was checked at 
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regular intervals after transfection as described above. Images were taken from the same 
cell fields at each interval. Three different fields were observed for each plasmid. The mi- 
croscope and camera settings were the same for each image. Optimal exposure time was 
taken from a chamber of cells with full EGFP expression (transfected 24 hours previously) 
5 to ensure the exposure does not saturate. The first images were taken from 45 minutes to 
1 hour post transfection, thereafter with a 30-minute interval for the first 7.5 hours post 
transfection and an image was collected 26.5 hours post transfection. Five different fields 
were observed for each plasmid. Fluorescence was detected no later then 4 hours post 
transfection. Fluorescence in eF64L ( E222G was detected in one field 2.5 hours post 
10 transfection. In the remaining fields, fluorescence was detected no later than 4 hours post 
transfection (Figure 4). 

Example 4: Comparing pH sensitivity over range pH 4.0 to pH 12.0 between 
EGFP and eF64L,E222G. 

Samples of semi-purified EGFP from PS279 and eF64L,E222G from PS699 proteins pro- 
1 5 duced in COS7 cell expression are tested for pH sensitivity over a range from pH 4.0 to 
pH 12.5, with 0.5 point intervals. Excitation and emission scans were taken of each pro- 
tein at the pH values of 4.0, 8.0, and 12.5. The results of the scans found EGFP's excita- 
tion max to be 490 nm and emission max to be 510 nm and eF64L,E222G s excitation 
max to be 475 nm and emission max to be 504 nm. Different pH values did not affect the 
20 excitation or emission max. Single reads were made with excitation at 470 nm, emission 
at 510 nm and with 10 nm slits. The results show no clear differences between EGFP and 
eF64L,E222G regarding pH sensitivity, except what could be due to random fluctuation 
(Figure 5). This experiment has been repeated with essentially same result 

Example 5: Comparison of relative brightness ofGFPs. 

25 10 plasmids were constructed which combine some of the following features: 

- F or L at position 64. 

- S or T at position 65. 

- E or G at position 222. 

- "jellyfish" or "humanised enhanced" GFP backbone. 
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The plasmids were transfected into CHO cells. One, two and four days later the cells were 
inspected visually in a fluorescence microscope by two people. The excitation was 475/40 
= blue light and the emission 510-560 = green light. Cells were scored on a "green" scale 
ranging from essentially black to extremely bright (Table 3). Results did not change much 



5 with time. 



Table 3 



Plasmid 


"greenness" 


GFP (* UVmax) 


codon context 


aa 64 


aa 65 


aa 222 


PS854 


black 


jf-GFP * 


jellyfish 


F 


S 


E 


PS845 


almost black 


jf-GFP-E222G 


jellyfish 


F 


S 


G 


PS846 


almost black 


e-GFP-E222G 


humanised 


F 


S 


G 


PS844 


almost black 


e-GFP * 


humanised 


F 


S 


E 


PS350 


light green 


jf-GFP-F64L * 


jellyfish 


L 


S 


E 


PS351 


green 


jf-GFP-S65T 


jellyfish 


L 


T 


E 


PS832 


green 


jf-GFP-F64L,E222G 


jellyfish 


L 


S 


G 


PS399 


bright green 


e-GFP-F64L * 


humanised 


L 


S 


E 


PS699 


very bright green 


e-GFP-F64L,E222G 


humanised 


L 


s 


G 


PS279 


very bright green 


EGFP 


humanised 


L 


T 


E 



The plasmids were also transfected into HeLa cells. After 24 hours transfection the cells 
were run on a FACS Calibur flow cytometer for characterisation of whole cell fluores- 
1 0 cence, with excitation at 488nm and emission viewed with fluorescence filter set 

530/30nm (range 515-545nm). 10000 events were collected for each transfection and 2 
replicates earned out for each construct. Average fluorescent intensities from the FACS 
analysis were obtained as geometric means (mean fluorescence on log scale) and results 
are shown in Table 4. 



WO 01/98338 PCT/EP01/06848 

21 



Table 4 



Plasmid 


FACS 


C5FP f* UVm?iv\ 


UUUUll OUF lie Al 




aa bo 


aa 


rOCHD 




if OPD COOOP 


jellyfish 


F 


S 


G. 


DOQC4 


o.o 


jT-Wt(j3rH 


jellyfish 


F 


S 


E 


roOOl) 




jf-BioGreen * 


jellyfish 


L 


S 


E 


PS846 


9.4 




iiurnanisea 


c 
r 


r> 
O 


\j 


PS832 


16.5 


jf-BioE222G 


jellyfish 


L 


s 


G 


PS351 


22.2 


jf-BioST 


jellyfish 


L 


T 


E 


PS844 


24.5 


e-wtGFP * 


humanised 


F 


S 


E 


PS399 


73.3 


e-BioGreen * 


humanised 


L 


s 


E 


PS699 


209.2 


e-BioE222G 


humanised 


L 


s 


G 


PS279 


421 


EGFP 


humanised 


L 


T 


E 



It is clear from the table above that, when expressed in the mammalian HeLa cell, the 
GFPs with humanised codon are far brighter than the GFPs with jellyfish codon. EGFP 
5 and e-BioE222G being the brightest. It is no surprise that EGFP is about twice as bright 
as E-BioE222G under these conditions. The excitation at the FACS is at 488nm, dose the 
excitation maximum of EGFP at 490nm. As illustrated in Table 5 below 97% of the emis- 
sion from EGFP will be picked up, whereas only 86% from the e-BioE222G. Furthermore, 
the difference between the intensity of EGFP and e-bioE222G when excited at the e- 
1 0 bioE222G excitation maximum of 470 is not as pronounced. 



Table 5 





PS1189 


PS1191 




PS1185 


PS1186 




eLTE 


eLSG 




ifLTE 


jfLSG 


Emission intensity with excitation at 470 nm 


131,4 


94,1 




155,0 


167,2 


Emission intensity with excitation at 488 nm 


148,1 


80,4 




178,2 


151,2 


Excitation max 


492 nm 


468 nm 




490 nm 


473 nm 


Emission intensity at excitation max 


152,9 


93,8 




183,3 


169,1 


Ratio: Em. intensity(488)/Em. intensity(max) 


0,97 


0,86 




0,97 


0,89 


Emission max 


509 nm 


505 nm 




510 nm 


506 nm 


Emission intensity at emission max 


71,2 


55,6 




444 


432 
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In mammalian cells enhanced GFPs were brighter than jellyfish GFPs. In E.Coli. jellyfish 
GFPs were brighter than enhanced GFPs. Thus, when it is worthwhile to choose the GFP 
backbone with care according to the subsequent host 
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CLAIMS 

1 . A fluorescent protein derived from Green Fluorescent Protein (GFP) or any functional 
GFP analogue, wherein the amino acid in position 1 preceding the chromophore has been 
mutated and wherein the Glutamic acid in position 222 has been mutated said mutated 

5 GFP has an excitation maximum at a higher wavelength and the fluorescence is in- 
creased when the mutated GFP is expressed in cells incubated at a temperature of 30°C 
or above compared to wild-type GFP. 

2. A fluorescent protein according to the preceding claim, wherein the chromophore is in 
position 65-57 of the predicted primary amino acid sequence of GFP. 

10 3. A fluorescent protein according to any one of the preceding claims, said protein being 
derived from Aequoria victorea or Renilla. 

4. A fluorescent protein according to any one of the preceding claims, wherein the amino 
add F in position 64 of the GFP has been substituted by an aliphatic amino acid. 

5. A fluorescent protein according to any one of the preceding claims, wherein the amino 
15 acid F in position 64 of the GFP has been substituted by an amino acid selected from the 

group consisting of L, I, V, A and G. 

6. A fluorescent protein according to any one of the preceding claims, wherein the amino 
acid F in position 64 of the GFP has been substituted by L. 

7. A fluorescent protein according to any one of the preceding claims, wherein the amino 
20 acid E in position 222 of the GFP has been substituted by an amino acid selected from 

the group consisting of G, A, V, L, I, F, S, T, N, and Q. 

8. A fluorescent protein according to any one of the preceding claims, wherein the amino 
acid E in position 222 of the GFP has been substituted by G. 

9. A fluorescent protein according to any one of the preceding claims having the amino 
25 acid sequence disclosed in SEQ ID NO: 4. 
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10. A fluorescent protein according to any one of the preceding claims having the amino 
acid sequence disclosed in SEQ ID NO: 8. 

1 1 . A fusion compound consisting of a fluorescent protein (GFP) according to any of the 
preceding claims, wherein the GFP is linked to a polypeptide. 

5 12. A fusion compound according to the preceding claim, wherein the polypeptide is a ki- 
nase, preferably the catalytic subunit of protein kinase A, or protein kinase C t or Erk1, or a 
cytoskeletal element. 

13. A nucleotide sequence coding for the Fluorescent protein of any of the preceding 
claims. 

10 14. A nucleotide sequence according to the preceding claim, shown in SEQ ID NO: 3. 

15. A nucleotide sequence according to the preceding claim, shown in SEQ ID NO: 7. 

16. A nucleotide sequence according to any of the preceding claims in the form of a DNA 
sequence. 

17. A host transformed with a DNA construct according to any of the preceding claims. 

15 1 8. A process for preparing a polypeptide, comprising cultivating a host according to any 
of the preceding claims and obtaining therefrom the polypeptide expressed by said nu- 
cleotide sequence. 

19. Use of the fluorescent protein according to any of the preceding claims in an in vitro 
assay for measuring protein kinase activity, or dephosphorylation activity, or for measur- 
20 ing protein redistribution. 
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Figure 1 
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Figure 2 
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Figure 3 
Panel A 
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Figure 3 
Panel B 
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Figure 3 
Panel C 
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Figure 3 
Panel D 
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Figure 4 
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Figure 5 
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SEQUENCE LISTING 

<110> Biolmage A/S 

<120> Novel Fluorescent Proteins 

<130> 25158PC1 
<160> 8 

<170> FastSEQ for Windows Version 3.0 

<210> 1 
<211> 720 
<212> DNA 

<213> Aequoria Victoria 

<220> 

<221> CDS 

<222> (1)...(717) 

<400> 1 

atg gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

15 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

gag ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc aca eta gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tct tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gee atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 - — 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100' 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gec gac aag cag aag aac 480 
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Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gec ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
195 200 205 

age aaa gac ccc aac gag aag cgc gat cac atg gtc ctg ctg gag ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 
210 215 



220 



gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag 717 
Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 



taa 

<210> 2 
<211> 239 
<212> PRT 

<213> Aequoria Victoria 
<400> 2 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin Hxs Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

10 ° 105 110 

Val Lys Phe Glu Gly Asp Thx Leu Val Asn Arg He Glu Leu Lys Gly 

115 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

13 ° 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
145 150 155 i 6 o 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

170 175 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp -Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Glu Phe 

210 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
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<210> 3 
<211> 720 
<212> DNA 

<213> Aequoria Victoria 

<220> 

<221> CDS 

<222> <1) . . . (717) 

<400> 3 

at 9 gtg age aag ggc gag gag ctg ttc acc ggg gtg gtg ccc ate ctg 4 8 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu 
1 5 10 15 

gtc gag ctg gac ggc gac gta aac ggc cac aag ttc age gtg tec ggc 96 
Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 
20 25 30 

g a 9 ggc gag ggc gat gec acc tac ggc aag ctg acc ctg aag ttc ate 144 
Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 
35 40 45 

tgc acc acc ggc aag ctg ccc gtg ccc tgg ccc aca eta gtg acc acc 192 
Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 
50 55 60 

ctg tct tac ggc gtg cag tgc ttc age cgc tac ccc gac cac atg aag 240 
Leu Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

cag cac gac ttc ttc aag tec gec atg ccc gaa ggc tac gtc cag gag 288 
Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 
85 90 95 

cgc acc ate ttc ttc aag gac gac ggc aac tac aag acc cgc gec gag 336 
Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 
100 105 110 

gtg aag ttc gag ggc gac acc ctg gtg aac cgc ate gag ctg aag ggc 384 
Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 
115 120 125 

ate gac ttc aag gag gac ggc aac ate ctg ggg cac aag ctg gag tac 432 
He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 
130 135 140 

aac tac aac age cac aac gtc tat ate atg gec gac aag cag aag aac 480 
Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
!45 150 155 160 

ggc ate aag gtg aac ttc aag ate cgc cac aac ate gag gac ggc age 528 
Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 
165 170 175 

gtg cag etc gee gac cac tac cag cag aac acc ccc ate ggc gac ggc 576 
Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 
180 185 190 

ccc gtg ctg ctg ccc gac aac cac tac ctg age acc cag tec gee ctg 624 
Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 
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205 



age aaa gac ccc aac gag aag cgc gat cac atg gtc etc eta ggg ttc 672 
Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Gly Phe 

Oin ^ n r- 



210 215 220 

gtg acc gec gec ggg ate act etc ggc atg gac gag ctg tac aag 
Val Thr Ala Ala Gly lie Thr Leu Gly Met Asp Glu Leu Tyr Lys 
225 230 235 

taa 

<210> 4 
<211> 239 
<212> PRT 

<213> Aequoria Victoria 
<400> 4 

Met Val Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu 

1 5 10 15 

Val Glu Leu Asp Gly Asp Val Asn Gly His Lys Phe Ser Val Ser Gly 

20 25 30 

Glu Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He 

35 40 45 

Cys Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr 

50 55 60 

Leu Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys 
65 70 75 80 

Gin His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu 

85 90 95 

Arg Thr He Phe Phe Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu 

100 105 HO 

Val Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly 

H5 120 125 

He Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Leu Glu Tyr 

130 135 140 

Asn Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Gin Lys Asn 
" 5 150 155 160 

Gly He Lys Val Asn Phe Lys He Arg His Asn He Glu Asp Gly Ser 

165 170 175 

Val Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly 

180 185 190 

Pro Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu 

195 200 205 

Ser Lys Asp Pro Asn Glu Lys Arg Asp His Met Val Leu Leu Gly Phe 

21° 215 220 

Val Thr Ala Ala Gly He Thr Leu Gly Met Asp Glu Leu Tyr Lys 
22 5 230 235 

<210> 5 

<211> 717 

<212> DNA 

<213> Aequoria Victoria 

<220> 

<221> CDS 

<222> (1) . . . (714) 

<400> 5 

atg agt aaa gga gaa gaa ctt ttc act gga gtt gtc cca att ctt gtt 
Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro He Leu Val 



717 



720 



48 
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1 5 10 15 

gaa tta gat ggc gat gtt aat ggg caa aaa ttc tct gtt agt gga gag 96 
Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser Gly Glu 
20 25 30 

ggt gaa ggt gat gca aca tac gga aaa ctt acc ctt aaa ttt att tgc 144 
Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He Cys 
35 40 45 

act act ggg aag eta cct gtt cca tgg cca acg ctt gtc act act etc 192 
Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 
50 55 60 



tct tat ggt gtt caa tgc ttt tct aga tac cca gat cat atg aaa cag 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 



65 70 75 



80 



tat aac tea cat aat gta tac ate atg gca gac aaa cca aag aat ggc 
Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro Lys Asn Gly 
145 150 155 160 

ate aaa gtt aac ttc aaa att aga cac aac att aaa gat gga age gtt 
He Lys Val Asn Phe Lys He Arg His Asn He Lys Asp Gly Ser Val 
165 170 175 



240 



cat gac ttt ttc aag agt gec atg ccc gaa ggt tat gta cag gaa aga 2 88 

His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

act ata ttt tac aaa gat gac ggg aac tac aag aca cgt get gaa gtc 336 
Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 3 84 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg lie Glu Leu Lys Gly lie 
115 120 125 

gat ttt aaa gaa gat gga aac att ctt gga cac aaa atg gaa tac aat 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met Glu Tyr Asn 
130 135 140 



480 



528 



caa tta gca gac cat tat caa caa aat act cca att ggc gat ggc cct 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 
180 185 190 

gtc ctt tta cca gac aac cat tac ctg tec acg caa-tect gee ctt tec 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 

200 205 

aaa gat ccc aac gaa aag aga gat cac atg ate ctt ctt gag ttt gta 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu Phe Val 
210 215 220 

aca get get ggg att aca cat ggc atg gat gaa ggg tac aag 714 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Gly Tyr Lys 
225 230 235 

taa 71? 



<210> 6 
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<211> 238 
<212> PRT 

<213> Aequoria Victoria 
<400> 6 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

t" 5 10 15 

Glu Leu Asp Gly Asp Val Asn Gly Gin Lys Phe Ser Val Ser Gly Glu 

20 25 30 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe lie Cy 
35 40 



45 



Thr Thr Gly Lys Leu Pro Val Pro Trp Pro Thr Leu Val Thr Thr Leu 

50 55 60 

Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
" 70 75 80 

Has Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 

85 90 95 

Thr He Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 

100 105 ai0 

Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 

115 120 12 5 

Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met Glu Tyr Asn 

Tyr Asn Ser His Asn Val Tyr He Met Ala Asp Lys Pro Lys Asn Gly 
It 150 155 16o 

He Lys Val Asn Phe Lys He Arg His Asn He Lys Asp Gly Ser Val 

165 170 175 

Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro He Gly Asp Gly Pro 

180 185 190 

Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 



205 



Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Glu Phe Val 

215 

Thr His Gly Met Asp Glu 
230 235 



Thr Ala Ala Gly He Thr His Gly Met Asp Glu Gly Tyr Lys 



<210> 7 
<211> 717 
<212> DNA 

<213> Aequovia Victoria 

<220> 

<221> CDS 

<222> (1) . . . (717) 

<400> 7 

Se? ^ ff* Ctt ttC aCt 993 3tt gtC cca att ctt 9tt 48 

Met Ser Lys Gly Glu Glu Leu Phe Thr Gly Val Val Pro lie Leu Val 

5 10 15 

Si 3 9at 9tt aat 999 Caa aaa ttc tct 9" a 9 fc 39* 9*9 96 

Glu Leu Asp Gly AS p val Asn Gly Gin Lys Phe Ser Val Ser lly Glu 

20 25 30 

mt of ^ taC 993 aaa Ctt acc ctt aaa ttt att tgc 144 

Gly Glu Gly Asp Ala Thr Tyr Gly Lys Leu Thr Leu Lys Phe He cys 

35 40 45 

Thr t£ Ifv W 9 r Cta 1^ CCa t99 CCa aC9 Ctt 9tc act act ctc 

Thr Thr Gly Lys Leu Pro Val Pro Xrp Pro xhr ieu Val Tlu _ Thr 

50 55 60 
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tct tat ggt gtt caa tgc ttt tct aga tac cca gat cat atg aaa cag 240 
Ser Tyr Gly Val Gin Cys Phe Ser Arg Tyr Pro Asp His Met Lys Gin 
65 70 75 80 

cat gac ttt ttc aag agt gcc atg ccc gaa ggt tat gta cag gaa aga 288 
His Asp Phe Phe Lys Ser Ala Met Pro Glu Gly Tyr Val Gin Glu Arg 
85 90 95 

act ata ttt tac aaa gat gac ggg aac tac aag aca cgt get gaa gtc 33 6 

Thx lie Phe Tyr Lys Asp Asp Gly Asn Tyr Lys Thr Arg Ala Glu Val 
100 105 110 

aag ttt gaa ggt gat acc ctt gtt aat aga ate gag tta aaa ggt att 384 
Lys Phe Glu Gly Asp Thr Leu Val Asn Arg He Glu Leu Lys Gly He 
115 120 125 

gat ttt aaa gaa gat gga aac att ctt gga cac aaa atg gaa tac aat 432 
Asp Phe Lys Glu Asp Gly Asn He Leu Gly His Lys Met Glu Tyr Asn 
130 135 140 

tat aac tea cat aat gta tac ate atg gca gac aaa cca aag aat ggc 480 
Tyr Asn Ser His Asn Val Tyr lie Met Ala Asp Lys Pro Lys Asn Gly 
145 150 155 160 

ate aaa gtt aac ttc aaa att aga cac aac att aaa gat gga age gtt 52 8 

He Lys Val Asn Phe Lys lie Arg His Asn He Lys Asp Gly Ser Val 
165 170 175 

caa tta gca gac cat tat caa caa aat act cca att ggc gat ggc cct 576 
Gin Leu Ala Asp His Tyr Gin Gin Asn Thr Pro lie Gly Asp Gly Pro 
180 185 190 

gtc ctt tta cca gac aac cat tac ctg tec acg caa tct gcc ctt tec 624 
Val Leu Leu Pro Asp Asn His Tyr Leu Ser Thr Gin Ser Ala Leu Ser 
195 200 205 

aaa gat ccc aac gaa aag aga gat cac atg ate etc eta ggg ttt gta 672 
Lys Asp Pro Asn Glu Lys Arg Asp His Met He Leu Leu Gly Phe Val 
210 215 220 

aca get get ggg att aca cat ggc atg gat gaa eta tac aaa taa 717 
Thr Ala Ala Gly He Thr His Gly Met Asp Glu Leu Tyr Lys * 
225 230 235 



<210> 8 
<211> 238 
<212> PRT 

<213> Aequovia Victoria 



<400> 8 



Met 


Ser 


Lys 


Gly Glu 


Glu 


Leu Phe Thr 


Gly 


Val 


Val 


Pro 


He 


Leu 


Val 


1 






5 






10 










15 




Glu 


Leu 


Asp 


Gly Asp 


Val 


Asn Gly Gin 


Lys 


Phe 


Ser 


Val 


Ser 


Gly 


Glu 








20 




25 










30 






Gly 


Glu 


Gly 


Asp Ala 


Thr 


Tyr Gly Lys 


Leu 


Thr 


Leu 


Lys 


Phe 


He 


Cys 






35 






40 








45 






Thr 


Thr 


Gly 


Lys Leu 


Pro 


Val Pro Trp 


Pro 


Thr 


Leu 


Val 


Thr 


Thr 


Leu 




50 








55 






60 










Ser 


Tyr 


Gly 


Val Gin 


Cys 


Phe Ser Arg 


Tyr 


Pro 


Asp 


His 


Met 


Lys 


Gin 
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65 

His Asp Phe Phe 

Thr lie Phe Tyr 
100 

Lys Phe Glu Gly 
115 

Asp Phe Lys Glu 
130 

Tyr Asn Ser His 
145 

lie Lys Val Asn 

Gin Leu Ala Asp 
180 

Val Leu Leu Pro 
195 

Lys Asp Pro Asn 
210 

Thr Ala Ala Gly 
225 



Lys 


70 










75 










80 


Ser 


Ala 


Met 


Pro 


Glu 


Gly 


Tyr 


Val 


Gin 


Glu 


Arg 


85 










90 










95 


Lys 


Asp 


Asp 


Gly Asn 


Tyr 


Lys 


Thr 


Arg 


Ala 


Glu 


Val 










105 










110 






Asp 


Thr 


Leu 


Val 


Asn 


Arg 


He 


Glu 


Leu 


Lys 


Gly 


He 








120 










125 




Asp 


Gly Asn 


lie 


Leu 


Gly His 


Lys 


Met 


Glu 


Tyr 


Asn 






135 










140 








Asn 


Val 


Tyr 


lie 


Met 


Ala 


Asp 


Lys 


Pro 


Lys 


Asn 


Gly 


Phe 


150 










155 










160 


Lys 


lie 


Arg 


His 


Asn 


He 


Lys 


Asp 


Gly 


Ser 


Val 


165 










170 






175 




His 


Tyr 


Gin 


Gin 


Asn 
185 


Thr 


Pro 


He 


Gly 


Asp 
190 


Gly 


Pro 


Asp 


Asn 


His 


Tyr 


Leu 


Ser 


Thr 


Gin 


Ser 


Ala 


Leu 


Ser 


Glu 






200 










205 








Lys 


Arg 


Asp 


His 


Met 


He 


Leu 


Leu 


Gly 


Phe 


Val 






215 










220 








lie 


Thr 


His 


Gly Met 


Asp 


Glu 


Leu 


Tyr 


Lys 








230 










235 
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